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ABSTRACT 

This paper investigates whether nonlinear gravitational instability can account for the 
clustering of galaxies on large and small scales, and for the evolution of clustering with 
epoch. The local clustering spectrum is accurately established, and bias is not a great 
source of uncertainty: the real-space power spectra of optical and IRAS galaxies show 
only a weak scale-dependent relative bias of b ~ 1.15 on large scales, increasing to 
b ~ 1.5 on the smallest scales. Comparison with results in redshift space favours a 
relatively small distortion parameter (3 opt = il°' 6 /b opt ~ 0.4. 

No CDM-like spectrum is consistent with the shape of the observed nonlinear spec- 
trum. Unbiased low-density models greatly overpredict the small-scale correlations; 
high-density models would require a bias which does not vary monotonically with scale. 
The true linear power spectrum contains a primordial feature at k ~ 0.1 /iMpc -1 , and 
must break quite abruptly to an effective slope of n < —2.3 on smaller scales 

This empirical fluctuation spectrum also fits the CFRS data on the evolution 
of clustering, provided the universe is open with Q ~ 0.3. Only this case explains 
naturally how the small-scale spectrum can evolve at the observed rate while retaining 
the same power-law index. An unbiased open model also matches correctly the large- 
scale COBE data, and offers an attractively simple picture for the phenomenology of 
galaxy clustering. 
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1 INTRODUCTION 

Studies of galaxy clustering have progressed steadily to- 
wards measurements on ever larger scales. The influential 
text by Peebles (1980) could call only on results from an- 
gular clustering in the Lick survey, which established the 
power-law form of the correlation function at separations 
< 10ft -1 Mpc (h = Ho/100 kms^Mpc -1 ). The following- 
years have yielded a succession of larger and deeper redshift 
surveys which probe clustering well into the linear regime at 
separations approaching 100/i -1 Mpc (e.g. CfAl: Davis & 
Peebles 1983; CfA2: de Lapparent et al. 1986; IRAS QDOT: 
Saunders et al. 1991; IRAS 1.2-Jy: Fisher et al. 1993; APM- 
Stromlo: Loveday et al. 1992). 

These later studies have concentrated on the shape of 
the large-scale fluctuation spectrum, for two main reasons: 
(1) the shape of the linear spectrum allows a test of funda- 
mental models for the generation of fluctuations, plus the 
modifying effects of dark matter; (2) redshift-space distor- 
tions mean that redshift surveys provide a highly distorted 
measure of clustering at separations < 1 Mpc. As long 
as there was uncertainty over the form of large-scale clus- 
tering, it therefore made sense to keep the focus in this 
area. However, a comparison of various large-scale measure- 
ments by Peacock & Dodds (1994; PD94) suggested that 
the shape of the clustering power spectrum for wavenum- 
bers k < 0.3 /iMpc" 1 was being measured consistently by 
a variety of galaxy tracers. One aim of the present paper is 



therefore to concentrate on the small-scale data, to see how 
extending the range of the clustering spectrum narrows the 
possible interpretations of the data. 

A further development which makes it timely to concen- 
trate on non-linear clustering is the evolution of the galaxy 
correlation function. The angular clustering of faint galax- 
ies has long indicated changes in £(r) with epoch (e.g. Efs- 
tathiou et al. 1991; Neuschaefer, Windhorst & Dressier 1991; 
Couch et al. 1993; Roche et al. 1993), but a more direct 
measurement has recently been made by the Canada- France 
Redshift Survey (Le Fevre et al. 1996), which yields spatial 
clustering in different redshift bins up to z = 1. Although a 
variety of different models are capable of accounting for the 
local clustering data, these alternatives make different pre- 
dictions for the change in clustering with epoch; the second 
aim of the present paper is therefore to combine the CFRS 
measurements with the data on clustering at z = 0, and to 
ask if any model can successfully account for both. 

The plan of the paper is as follows. Section 2 assembles 
the data on the real-space clustering of optical and IRAS 
galaxies, and considers their relative bias and degrees of 
redshift-space distortion. Section 3 reviews analytical meth- 
ods for obtaining nonlinear power spectra, and proposes 
some empirical models which can explain the data. Section 

4 explores the predictions for the evolution of clustering in 
these models, in comparison with the CFRS data. Section 

5 adds the requirement of consistency with fluctuations in 
the cosmic microwave background (CMB). Section 6 sums 
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up the conclusions of these studies, which suggest that a 
low-density open universe with a small degree of bias is the 
line of least resistance. 



2 THE CLUSTERING OF OPTICAL AND 
IRAS GALAXIES 

2.1 Clustering in real space 

It is not wise to assume in advance that any given class of 
galaxy traces cosmological density in an unbiased fashion, 
and we should therefore look at the properties of a variety 
of tracers. Two interesting cases for which the best data ex- 
ist are optically-selected galaxies and IRAS galaxies. Since 
we are interested in clustering at small separations, it is 
necessary to use real-space results. For optical galaxies, the 
best power-spectrum determination is from the angular cor- 
relations of the APM survey (Maddox et al. 1990; 1996), 
as deprojected by Baugh & Efstathiou (1993; 1994). For 
IRAS galaxies, the best available result is from the cross- 
correlation between the l-in-6 QDOT survey and its parent 
catalogue (Saunders, Rowan- Robinson & Lawrence 1992). 
To reduce the data to accurate estimates of the real-space 
power spectra for optical and IRAS galaxies requires a num- 
ber of steps, which are detailed below. The resulting power 
spectra are shown in Figure 1, where we follow PD94 in us- 
ing a dimensionless notation for the power spectrum: A 2 is 
the contribution to the fractional density variance per bin 
of lnfc. In the convention of Peebles (1980), this is 



A 2 (fc) = Aa 



■^k A \S k \ 2 , 



dlnfc (2tt) 3 
and the relation to the correlation function is 



dk sinfcr 



k kr 
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A dominant topic in PD94 was how to correct redshift- 
survey data for the distorting effects of peculiar velocities 
on the radial coordinate; however, a variety of projection 
schemes exist whereby the real-space correlations may be ob- 
tained in an unbiased manner. Saunders et al. (1992) used 
the angular cross-correlation between the QDOT redshift 
survey and its (larger) parent catalogue to obtain the pro- 
jected correlation function 



S(r) = 



£[(r 2 + x 2 ) 1/2 ] dx = 2 / 

- co J r 



(3) 



At first sight, it is not very attractive to use this to infer 
the power spectrum, because the window function involved 
is extremely broad: 



kr 



Jo(kr) 



(4) 



Fortunately, Saunders et al. inverted the integral relation 
between H and £ and then converted £ to a variance in cubi- 
cal cells of side £, which is readily converted to the power at 
an effective wavenumber, as discussed by Peacock (1991): 



a 2 (£) 



A 2 (fe eff ). 



(5) 



For power- law spectra, A 2 oc k n+i , with —2 < n < 0, 
u 2 (£) approximately measures A 2 (2.4/£), and the effective 
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Figure 1. The real-space power spectra of optically-selected 
APM galaxies (solid circles) and IRAS galaxies (open circles), 
obtained from the references given in the text. Except possible 
on the very largest scales, IRAS galaxies show weaker clustering, 
consistent with their suppression in high-density regions relative 
to optical galaxies. 

wavenumber can be determined even more precisely by per- 
forming the integral for <r 2 (£) over a smoothly-curving spec- 
trum of the observed form. 

The plot in Saunders et al. (1992) which contains a 2 (£) 
does not extend to very small scales, but we can recover the 
power spectrum in this regime because the small-scale cor- 
relations are very close to a single power law. Consider the 
projected correlations and power spectra for a correlation 
function of the form £ = [r/ro] 1 ■ 



r~ y ' r( 7 /2) 



l]/2) 



[r/r ] 



A 2 (k) = - T(2 - 7 ) sin ^—Jh. [ fcro p 

TV 2 



(6) 



(7) 



These relations suggest an equivalent wavenumber E(r)/r = 
A 2 (fc c ff), where k e s = 3.055/r for the 7 = 1.57 slope char- 
acteristic of the small-scale IRAS correlation function. The 
IRAS power spectrum from the Saunders et al. (1992) paper 
is therefore based on this method for k > 1 ftMpc -1 and on 
a 2 (£) for larger scales; the two techniques match very well 
where they join. 

The real-space power spectrum for optically-selected 
galaxies is available in a much more direct form from the 
work of Baugh & Efstathiou (1993, 1994). They inverted 
the integral equations relating the angular correlation func- 
tion and the angular power spectrum to the spatial power 
spectrum. The results from the first of these papers were 
used in PD94. Before using them here, however, it is advis- 
able to make one small correction. The Baugh & Efstathiou 
formalism incorporates evolution of clustering with redshift, 
but their standard results are quoted assuming no evolu- 
tion. Since the median redshift of the 17 < B < 20 samples 
they analyze is close to 0.2, this can lead to a significant 
under-estimate of the zero-redshift correlations. This can 



be checked against the recent results of the APM-Stromlo 
B < 17 redshift survey team. Loveday et al. (1995) applied 
the projection technique of Saunders et al. (1992) to deduce 
that £ = [r/5.1 /i _1 Mpc] -1 ' 71 for optically selected galaxies. 
Fourier transforming the Baugh & Efstathiou power spec- 
trum yields a correlation function which is uniformly lower 
than this by a factor of approximately 0.8 over to range 1 - K 
10 h~ 1 Mpc, and so we have re-scaled the power spectrum to i 
correct for this factor. The results produced in this way are \ 
essentially identical to those given in Maddox et al. (1996), § 
where the deprojection was performed allowing for evolu- 
tion. The errors in Baugh & Efstathiou (1993) are smaller on 
large scales than those given by Maddox et al. (1996), even 
though both are supposedly based on field-to-field scatter. 
To be conservative, the larger errors were adopted. 
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2.2 Optical-to-IRAS bias 

Figure 1 shows the well-known result that IRAS galaxies 
cluster less strongly than optical galaxies. At least one of 
these classes of galaxy must therefore be biased with respect 
to the mass. The relative bias as a function of scale can be 
defined as the square root of the ratio of the power spectra. 
This is shown in Figure 2. For the smallest scales, the rela- 
tive bias is about 1.5, but this declines with scale, reaching 
a plateau at 6 op t/&iRAs — 1-1 around k ~ 0.1 ftMpc -1 , be- 
fore declining rather abruptly below 1 on larger scales. For 
k < 0.05 AMpc -1 , according to these data, IRAS galaxies 
appear to be more clustered than optical galaxies. This is a 
suspicious feature, both for its sudden onset and because the 
known suppression of IRAS galaxies in high-density regions 
would lead us to expect IRAS galaxies to be less strongly 
clustered on all scales. Apart from this last feature, the strik- 
ing feature of the plot is the slowly-varying nature of the 
relative bias, and it would be useful to describe this by a 
fitting formula. 

A reasonable form to try is something which is a com- 
bination of linear bias and a non- linear response for A 2 > 1: 

l + A^Il + bxAjVs] 62 . (8) 

Mann, Peacock & Heavens (1996) show that this is a prac- 
tical form for describing many empirically reasonable forms 
of bias. Local modifications of iV-body density fields such as 
suppressing particles in low-density voids or weighting up 
particles in clusters produce a scale dependence which can 
be fitted by this form. The behaviour is always monoton- 
ically increasing towards small scales, as proved on rather 
general grounds for all local bias schemes by Coles (1993). 
The scale dependence can often be relatively weak, as is the 
case here, and the observed relative bias of optical and IRAS 
galaxies is therefore very much the sort of thing that would 
be expected from the known enhancement of optical galaxies 
relative to IRAS galaxies in high-density regions (Strauss et 
al. 1992). 

The best-fitting parameters in this formula are 
&i = 1.17 ±0.10 

(9) 

b 2 = 1.13 ±0.01, 

so that the linear bias parameter in the long-wavelength 
regime would be 

b= [hb 2 ] 1/2 = 1.15 ±0.05. (10) 
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Figure 2. The relative bias of optical galaxies relative to IRAS 
galaxies as a function of wavcnumber, obtained from the square 
root of the ratio of the power spectra in Fig. 1. The large-scale 
bias takes values between 1.1 and 1.2 over a decade in scale, 
0.06 < k < 0.6/iMpc -1 , before breaking sharply to values < 1 
on larger scales. 

Much the same result is obtained whether the whole range of 
the data is used, or whether the large-scale points with b < 1 
are omitted. Because the relative bias is nearly a constant 
in the region of 1.2 over a large range of k, this says that it 
is indeed implausible that the large-scale points with b < 1 
can be correct. 

Figure 3 shows the results of scaling the IRAS data to 
the optical by this formula. The agreement in shape is out- 
standingly good for all wavenumbers k > 0.06/iMpc _ . Of 
particular interest is the inflection around k = 0.1 /iMpc -1 . 
This had been pointed out for optical galaxies by Baugh 
& Efstathiou (1993), and now appears to be confirmed by 
the IRAS data - which are derived from a more local vol- 
ume than the deeper APM data. The power spectrum is of 
the form of an almost exact power law for k > 0.15 /iMpc~\ 
but flattens quite abruptly for smaller wavenumbers. As dis- 
cussed above, there is clear disagreement between the optical 
and scaled IRAS data for k < 0.05 ftMpc -1 ; rather than be- 
ing surprised by this, we should consider it remarkable that 
the data agree so well to scales this large. Real-space mea- 
surements obtained by deprojection will always be prone to 
systematic errors on large scales, where the true small spa- 
tial signal can easily become swamped in projection. To see 
which determination to believe (if either) we should turn to 
clustering measurements in redshift space. 

2.3 Clustering in redshift space 

There are two sets of large-scale power-spectrum data for 
optical galaxies: from the CfA2 survey (Vogeley et al. 1992) 
and the APM-Stromlo survey (Loveday et al. 1992). In the 
latter case, the power spectrum has been converted from 
cell-count variances as in PD94, but this agrees very well 
with the recent direct calculation by Tadros & Efstathiou 
(1996). For IRAS galaxies, separate determinations have 



4 J. A. Peacock 




Figure 3. The real-space power spectrum of optical galaxies as 
estimated directly from the APM data of Fig. 1 (filled circles) 
and via a smoothly-varying scaling of the IRAS data from the 
same Figure. There is a good degree of unanimity concerning the 
inflection around k ~ 0.1 /iMpc -1 , but the two estimates diverge 
for k < 0.06 /iMpc" 1 . 



been published by Fisher et al. (1993) and Feldman, Kaiser 
& Peacock (1994). However, it is probably better to use 
the result of Tadros & Efstathiou (1995), which combined 
the 1.2-Jy and QDOT datasets. Figures 4a and 4b compare 
these redshift-space power spectra with the real-space data 
from the previous Section, and different behaviour is im- 
mediately apparent. For optical galaxies, the redshift-space 
data lie above the real-space results at small k, but fall be- 
low them at large k. This is qualitatively the behaviour that 
would be expected: on large scales apparent 3D clustering is 
enhanced by coherent infall (Kaiser 1987), whereas on small 
scales it is reduced by the smearing of 'fingers of God' due 
to virialized random velocities. This trend is not really seen 
in IRAS galaxies; on small scales, the real-space power is the 
larger as expected, but on large scales the IRAS real-space 
measurements exceeds the redshift-space data by almost a 
factor of 2. 

This is clear evidence that the largest-scale IRAS 
real-space data are too high; the three points with k < 
0.05 feMpc -1 will therefore be omitted from all subsequent 
analysis. For optical galaxies, Figure 4a suggests that the 
APM deprojection has worked extremely well out to the 
largest scale for which it is possible to compare with the 
shape of the redshift-space spectrum (k ~ 0.03 h Mpc -1 ). 
The data considered by PD94 included other tracers on 
these scales, such as Abell clusters, which showed a large- 
scale slope consistent with the APM determination out to 
slightly larger scales (k ~ 0.02 h Mpc -1 ), and the APM data 
appear to be reliable within their quoted errors up to this 
scale. Although results are given to smaller wavenumbers, 
we will take the conservative approach of not using them 
in the absence of independent verification. The conclusion 
of this comparison is therefore that the data shown in Fig- 
ure 3 define the power spectrum of optical galaxy clustering 
to a precision of about 20 per cent over three decades in 
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Figure 4. Comparison between the real-space power spectra of 
Fig. 1 and redshift-space data. The real-space data are shown as 
solid circles in both panels. Fig. 4a shows redshift-space results 
from the APM-Stromlo survey (open squares) and the CfA2 sur- 
vey (crosses). Fig. 4b shows the redshift-space results from the 
combined 1.2-Jy and QDOT surveys as open squares. For opti- 
cal galaxies, the redshift-space results are higher on large scales, 
as expected from peculiar-velocity distortions. The opposite is 
true for IRAS galaxies, suggesting that the largest-scale real-space 
points have been overestimated. 

scale: 0.02 < k < 20/iMpc~ 1 . With results of this verified 
accuracy now available, it should be possible to reach well- 
constrained conclusions about allowed models. 

2.4 Redshift-space distortions 

Before leaving this subject, it is interesting to look at the 
redshift-space distortions in a little more detail, since the 
size of the effect is sensitive to Q. PD94 gave an analytical 
approximation for the distorting effects of peculiar veloci- 
ties on power spectra, which expresses the power ratio as 
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a product of the large-scale Kaiser factor and a small-scale 
damping term: 

-frcdshift 



-frcal 



= [1 + f3(i 2 } 2 D{k(ia p 



(11) 



the power spectrum is anisotropic (fj, — k • f), and this ex- 
pression should be averaged over /j,. The parameter describ- 
ing the large-scale distortion is 

= V ; (12) 

the damping term depends on the assumed small-scale ve- 
locity distribution. For an exponential pair-wise distribution 
with pair- wise rms a p , the functional form is 



D[kfia p ] = [1 + {kno p f/2}- 



(13) 



(Ballinger, Peacock & Heavens 1996). For describing the 
effect on the power spectrum, we need a v in length units 
rather than velocity (i.e. a p — > a p /Ho). From measure- 
ments of pairwise velocities, sensible values for a p will lie 
in the range 3-4 h' 1 Mpc (Davis & Peebles 1983; Mo, Jing 
& Borner 1993; Fisher et al. 1994). The formula for the 
redshift-space power spectrum ignores quasi-linear modifi- 
cations, and this assumption has been questioned by Fisher 
& Nusser (1996). However the above relation only amounts 
to a linear boost of power at small k, with a reduction in 
power at small scales that depends quadratically on k. If 
the expression is not pushed to too small scales, and if we 
are not too interested in the interpretation of the exact nu- 
merical size of the damping term, then this two-parameter 
formula should always be able to describe the data well. 

Allowing the parameters (3 and a p to float, a maximum- 
likelihood fit can be performed between the real-space and 
redshift-space data (omitting the three largest-scale IRAS 
points). The real-space data, having smaller errors, are used 
to interpolate values at the k values of the redshift-space 
data, and the ratio is compared with the model. The re- 
sulting contours are shown in Figure 5. The optical results 
are far better constrained than the IRAS, reflecting the re- 
stricted range of k in the latter case, and there is a very 
clear preferred model: 



/3o P t ^ 0.40 

a p ~ 3.5 h^ 1 Mpc. 



(14) 



Note that the value of a p is very reasonable, despite the 
above caveats. 

From the earlier discussion of relative bias, we would 
expect (3 for IRAS galaxies to be a factor 1.15 larger than the 
optical figure. If anything, the IRAS plot appears to favour 
smaller figures, but probably not too much confidence should 
be placed in this, because of the demonstrated problem with 
the large-scale real-space data. The worst points have been 
removed, but some residual systematics may remain in the 
retained points of largest scale, and these are the ones which 
give the main signal for f3. 

How seriously should the low value of (3 opt be taken? 
A good reason for taking this analysis cautiously is that 
the statistical treatment of power-spectrum errors is rather 
simplistic. The published data points have been treated as 
independent, whereas it is inevitable that they will be cor- 
related to some extent (Feldman et al. 1994); a more com- 
plete analysis would have to employ the full covariance ma- 



o 
ft 




a) Optical 



-Prcal 



= 1 + 2/3/3 + /T/5; 



(15) 



0.2 0.4 0.6 0.8 1 




Figure 5. Likelihood contours (interval 0.5 in ln£) from fitting a 
model with large-scale Kaiser distortion plus fingcr-of-God damp- 
ing to the data of Fig. 4. The parameter a p is the pairwise velocity 
dispersion of galaxies divided by ifrj , and thus expressed in units 
of h^ 1 Mpc. Fig. 5a shows the results for optical galaxies; Fig. 5b 
shows IRAS galaxies with the three largest-scale real-space points 
omitted. Note that /3iras — 1.15/3opt is expected. Even allowing 
for possible systematics in the analysis, it is very hard to see how 
(3 = 1 can be allowed. 

trix. Nevertheless, the formal error in /3 op t from this analy- 
sis is very small (about 0.05), and it is more likely that we 
should be worrying about systematics. The applicability of 
the Kaiser factor has been adequately tested in simulations, 
so the main concern is whether the amplitudes of either the 
real- or redshift-space data could be incorrect. The averaged 
linear boost from the Kaiser factor is 

Prcdshift 



for (3 = 1 this is a factor 28/15 as against 1.30 for (3 = 0.4. 
(3 = 1 would therefore be allowed if the real-space power was 
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overestimated by a factor 1.43, or the redshift-space data too 
low by a similar amount. It is implausible that such a large 
factor can be found. The APM data were scaled by a factor 
of 1.25 to allow for the effects of evolution (see above). In de- 
tail, this number was obtained from a comparison with the 
amplitude of the real-space clustering in the APM-Stromlo 
redshift survey. Loveday et al. (1995) quote a scale-length 
of ro = 5.1 ± 0.2 ft -1 Mpc, or an rms uncertainty in power 
of 7 per cent - much smaller than the error needed to allow 
(3—1. The other worry is that different surveys may sample 
galaxies of differing luminosities and thus obtain a variety of 
clustering signals. Indeed, Loveday et al. (1995) suggested 
that galaxies with — 19 < Mb < — 15 were clustered a factor 
roughly 2 more weakly on large scales than were more lumi- 
nous galaxies. Is it possible that such an effect has reduced 
the amplitude of the optical redshift-space data? Certainly, 
the APM angular data refer to galaxies several magnitudes 
fainter than the redshift surveys. The real-space clustering 
signal therefore comes from a larger volume, but it is not 
likely to be due to galaxies of significantly different luminosi- 
ties. Without applied redshift limits, the selection function 
of magnitude-limited galaxy surveys peaks for luminosities 
around L* , and indeed the scaling tests for w(8) as a func- 
tion of depth applied by Maddox et al. (1990; 1996) leave 
little room for a depth-dependent effect. We therefore as- 
sume that the normalization uncertainty of 7 per cent in 
power is the principal contributor to the systematic error in 
Popt ■ Added in quadrature to the statistical uncertainty, this 
gives the minimal error bar 



(3. 



'opt 



0.40 ±0.12. 



(16) 



This uncertainty is approximately four times smaller than 
that obtained from a fuller analysis of the APM-Stromlo sur- 
vey by Tadros & Efstathiou (1996), but it is not necessarily 
unrealistic. Tadros & Efstathiou considered k < 0.1 /iMpc -1 
only, which increases the noise, and we have considered here 
both the APM-Stromlo and CfA2 surveys. 

In any case, a figure of (3 op t — 0.4 is certainly consistent 
with a number of recent determinations from measurements 
of clustering anisotropy within a survey. Ratcliffe et al. 
(1996) find (3 opt = 0.55 ±0.12 from the Durham/UKST sur- 
vey, and Loveday et al. (1996) obtain /3 opt = 0.48±0.12 from 
the APM/Stromlo survey. From the IRAS 1.2-Jy survey, 
Cole, Fisher & Weinberg (1995) obtain (3 1KAS = 0.52 ±0.13. 
This convergence on a value of around (3 = 0.5 is in very seri- 
ous disagreement with the values around unity inferred from 
the comparison of peculiar velocities with the galaxy den- 
sity field (Dekel 1994), and it is of the greatest importance 
that the discrepancy be resolved. The impression from the 
present work is that (3=1 would require gross errors in clus- 
tering measurements, and it seems implausible that this can 
be the correct value. The lower values suggest that O ~ 0.2 
- 0.3 if optical galaxies trace mass, or require b ~ 2 in an 
Einstein-de Sitter universe. 



3 NONLINEAR MASS POWER SPECTRA 
3.1 Analytical method 

In order to compare the clustering data with theoretical 
models, it is necessary to have an accurate means of predict- 
ing the effects of nonlinear growth on a given linear power 



spectrum. This was made possible by the insight of Hamilton 
ct al. (1991), who suggested a scaling formula for the growth 
of correlation functions. PD94 extended this method to work 
with power spectra and also to universes with Q, 1. The 
original suggestion of Hamilton et al. was that their scaling 
formula was spectrum independent, but Jain, Mo & White 
(1995) showed that this was not true. Peacock & Dodds 
(1996; PD96) give an improved version of the PD94 method 
which takes this into account, allowing the nonlinear spec- 
trum that results from any smoothly-varying linear spec- 
trum to be calculated. The method may be summarized as 
follows. 

The nonlinear spectrum is a function of the linear spec- 
trum at a smaller linear wavenumber: 



A£ L (fe»L)=/NL[A2(k.)], 



[l + AL(fcNL 



r 1/3 fc, 



(17) 



(18) 



PD96 give the following fitting formula for the nonlinear 
function: 



/nl(^) — X 



1 + B(3x + [Ax] c 



1 ± ([Ax] a g 3 (Q) /[Vx 1 / 2 ])^ 



1//3 



(19) 



In this expression, B describes a second-order deviation 
from linear growth; A and a parameterise the power-law 
which dominates the function in the quasilinear regime; V 
is the virialization parameter which gives the amplitude of 
the /nl(s;) tx x 3 ^ 2 asymptote (where the behaviour enters 
the 'stable clustering' limit); (3 softens the transition be- 
tween these regimes. The parameters and their dependence 
on spectrum are 



A = 0.482(l + n/3)~ 
B = 0.226 (l + n/3)~ 
ct = 3.310 (1 ± ti/3)~ 
13 = 0.862 (l + n/3)~ 
V = 11.55 (l + n/3)~ 



(20) 
(21) 
(22) 
(23) 
(24) 



For linear spectra which are not a power-law, particularly 
the CDM model, PD96 suggested that a tangent spectral 
index as a function of linear wavenumber should be used: 



n eff (fc L 



rflnP 
d In k 



(k = fc L /2). 



(25) 



The factor of 2 shift to smaller k is required because the 
tangent power-law at fc L overestimates (underestimates) the 
total degree of nonlinearity for curved spectra where n e ff de- 
creases (increases) as k increases; the results are not greatly 
sensitive to the exact shift. This prescription is able to pre- 
dict the nonlinear evolution of power-law and CDM spectra 
up to A 2 ~ 10 3 with an rms precision of about 7 per cent. 

Note that the cosmological model is present in the fit- 
ting formula only through the growth factor g, which governs 
the amplitude of the virialized portion of the spectrum. Ac- 
cording to Carroll, Press & Turner (1992), the growth factor 
may be approximated almost exactly by 



«?(fi) = -n m [{l 4 n ( 7 - fi v + (1 + O m /2)(l + Ov/70)] 1 , (26) 
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where (m) and vacuum (v) contributions to the density pa- 
rameter are distinguished; Q without a subscript generally 
means f2 m . When considering non-zero vacuum energy, it is 
usual to restrict attention to spatially flat models only. Mod- 
els with £1 < 1 thus come in two varieties: open (Q v = 0) 
and flat = 1 — fl m )- 

3.2 CDM models and normalization of power 
spectra 

As an illustration of these techniques in action, it is inter- 
esting to consider the CDM spectrum, which is a popular 
choice for the empirical description of the power spectrum, 
being a smoothly-curving form parameterised by a normal- 
ization as and a shape F*. The CDM power spectrum is 
A 2 (fc) oc fc n+3 T 2 ; usually n = 1 is adopted for the primor- 
dial spectrum. We shall use the BBKS approximation for the 
transfer function (which unfortunately suffered a misprint in 
PD94): 



T fc = 



ln(l + 2.34g) 

2.Uq X 
[1 + 3.89q + (16.1g) 2 + (5.46g) 3 + (6.71g) 4 



(27) 



-1/4 



where q = (k/h Mpc _1 )/r*. The shape parameter T* would 
be equal to Qh in a model with zero baryon content. PD94 
showed that there was a significant shift as a function of S7 B , 
which was generalized to models with Q 1 by Sugiyama 
(1995): 



r* = nh ex P [-n B (i + i/n)]. 



(28) 



Because of this scaling, F* = 0.94r, where V is the shape 
parameter defined by Efstathiou, Bond & White (1992). 

The normalization is specified by as: the linear-theory 
rms density contrast when averaged over spheres of radius 
8 ft" 1 Mpc: 



2 

or 



[sin kR — kR cos kR] 2 . 



(kR) 



(29) 



a\ is just A 2 (k) at some effective wavenumber, which is very 
well fitted by 



fceff//iMpc = 0.172 + 0.011[ln(r*/0.34)] / 



(30) 



What normalization is appropriate? A strong constraint 
comes from the abundance of massive clusters of galaxies, 
which is exponentially sensitive to the normalization, in a 
way that depends very little on spectrum. White, Efstathiou 
& Frenk (1993) argue that this gives 

-0.55 



as = 0.57 fi~ 



(31) 



to a tolerance of roughly 10 per cent. They consider spa- 
tially flat models only, but the scaling for open models is 
likely to be very similar. Notice that the content of this 
equation is very similar to the effect of redshift-space dis- 
tortions. The apparent (but non-linear) value of 6 op tcr8 is 
about 1, so the cluster constraint implies /3 opt — 0.57. This 
is a further reason for thinking that there must be a serious 
problem with the values of f3 ~ 1 deduced from velocity-field 
studies. This also says that the cluster abundance constraint 
gives a very similar value for the normalization to that ob- 
tained from small-scale pairwise velocity dispersions, since 
they scale with Q in almost the identical way. Taken literally, 
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Figure 6. The optical-galaxy power spectrum of Fig. 3 with 
the discrepant scaled IRAS points omitted (APM data are 
shown as solid points, scaled IRAS data as open circles for 
k > 0.05 ft Mpc" 1 ). Also shown as open circles for smaller 
wavenumbers are the redshift-space data from Fig. 4, corrected 
assuming /3 op t = 0.4, a p = 3.5 ft -1 Mpc. CDM models with 
T* = 0.25 and as = 0.6 & 1 are shown. Linear spectra are shown 
dotted, with nonlinear spectra shown solid. For the lower normal- 
ization, Q = 1 is assumed; for the higher normalization, f2 = 0.3 
open and flat models are shown (open being the higher). The 
unbiased models greatly exceed the small-scale data; the Q = 1 
model would require bias which was a strongly non-monotonic 
function of scale. 

the White et al. normalization would imply that optically- 
selected galaxies are antibiased for £1 < 0.3, whereas M/L 
ratios in clusters would prefer f2 ~ 0.15 - 0.2 in the absence 
of bias. We shall ignore this slight uncertainty and treat low- 
density models under the assumption that optically-selected 
galaxies trace mass as a first approximation: as = 1. For 
Einstein-de Sitter models, however, this assumption would 
make no sense and a lower normalization must be adopted; 
we take as = 0.6. Small variations on these assumptions do 
not change the conclusions. 

Figure 6 shows the predictions of CDM models with 
these normalizations and the shape parameter V* = 0.25 
claimed by PD94 to fit the shape of large-scale structure 
very well. The Einstein-de Sitter model and Q — 0.3 mod- 
els (open and flat) are shown, and none fare very well. As 
expected, the low-density models follow the low-fe cluster- 
ing data rather well, but both open and flat models start to 
predict too much small-scale power for k > 0.2 h Mpc -1 , 
and they do not produce much of an inflection around 
k ~ 0.1 ft Mpc" 1 . The open and flat models fare equally 
badly in this respect, only starting to part company at 
k > lh Mpc" 1 . Changing Q, would only change the pre- 
dicted spectrum in this regime, and so would not cure the 
problem. This tendency for low-density unbiased CDM mod- 
els to overpredict the small-scale power was noted in the 
T* = 0.2 case by Efstathiou, Sutherland & Maddox (1990), 
and recently rediscovered by Klypin et al. (1996). Because 
of its lower normalization, the Q — 1 model at least does not 
exceed the data, but it still presents serious problems. The 
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Figure 7. An extended form of Fig. 6, showing CDM spectra 
with a) r* = 0.15, b) T* = 0.35, for erg values between 0.1 and 
1, in steps of 0.1. Linear spectra arc dotted; the solid lines show 
nonlinear spectra for Q = 1, and the dashed lines show an open 
Q = 0.3 model. 

implied bias is an oscillating function of scale: it has a local 
maximum around k ~ 0.1/iMpc -1 (again, no inflection is 
predicted), and a local minimum around k ~ 3/iMpc^ 1 . No 
local bias scheme could give this non-monotonic behaviour 
(Coles 1993). 

These problems are general, and not specific to the ex- 
act value of T* used, or the tilt adopted. For models with 
n / 1, the results over the range of k considered are indis- 
tinguishable from those with n = 1 and a different value 
of r*. Figure 7 shows a grid of models with varying erg for 
T* = 0.15 and 0.35, and they show that the same kind of 
problems persist: 

(1) Unbiased models with erg ~ 1 over-produce small-scale 
power. The correct level of power around k = 1 h Mpc -1 
requires erg ~ 0.7, but the large-scale data are then very 



badly fitted, whatever value of T* is assumed. 

(2) No model reproduces the inflection seen around k ~ 
O.lhMpc- 1 . 

(3) Only open low-density models produce a small-scale 
power spectrum which resembles the observed power 
law. Both £1 — 1 models and flat low-density models 
reach the stable clustering regime and turn to a flatter 
slope rather abruptly at A 2 ~ 50 - 100. These models 
need a bias scheme capable of generating a sharp kink 
at just the required point. 

The reason PD94 were not able to reach these con- 
clusions is because of the improved nonlinear treatment 
used here, which allows high accuracy in the predictions at 
0-5 < k < lOhMpc -1 . PD94 considered only the deduced 
linear spectrum at < 0.5 /iMpc -1 , which reduces the ampli- 
tude of the conflict between CDM models and observation. 
Comparison of nonlinear spectra is clearly the more sensi- 
tive test, given accurate nonlinear predictions; the nonlinear 
treatment of PD96 in fact gives a slightly larger response for 
low-r* CDM models, which exacerbates the difficulties for 
CDM. The conclusion is therefore that the CDM spectrum 
is of the wrong shape, and cannot be made to fit the data, 
however its parameters are adjusted. 

3.3 Empirical linear power spectra 

What appears to be required is a linear spectrum which 
bends more abruptly around k ~ 0.1/iMpc~ , and the in- 
flection at this point is therefore to be interpreted as a resid- 
ual feature from the linear spectrum. The precise linear form 
can be recovered by running the nonlinear apparatus back- 
wards, although this is now a more complicated procedure 
than in PD94, because the nonlinear response depends on 
the slope of the linear spectrum. We have to proceed iter- 
atively, assuming an initial linear spectral slope, and then 
refining the dependence of this slope on scale. 

For low-density models, it is reasonable to perform this 
linearization under the assumption that the clustering of 
optically-selected galaxies is not significantly biased. Mod- 
els with f2 = 1 present greater difficulties: there is no point 
in recovering a linear spectrum which would evolve into the 
observed clustering when we know that the resulting ampli- 
tude of mass fluctuations would be far too high. Before per- 
forming the linearization, we need to allow for bias, but this 
requires a specific hypothesis for the variation of bias with 
scale. Since we do not know what this variation is, the best 
that can be done is to illustrate how the linearized spectrum 
depends on different assumptions for the bias. The nonlin- 
ear response is by definition highly sensitive to the linear 
spectrum, so there is at least some hope that the recovered 
linear spectrum will not depend critically on assumptions 
about bias. We shall consider two examples. The simplest is 
a constant linear bias: 

A 2 pt = b 2 AL ss ; (32) 

in the light of the earlier discussion, b = 1.6 is about the 
correct number. This is certainly an extreme example, since 
we have seen that bias generally increases at small scales. 
For an alternative model, we can take a hint from nature 
and guess that the scale dependence of the bias is similar to 
the form that relates optical galaxies and IRAS galaxies: 

l + A 2 opt = [l + biALj' 2 (33) 
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Non-linear bias models of this type are considered in de- 
tail by Mann, Peacock & Heavens (1996). Their work shows 
that a broad range of biasing prescriptions yield a galaxy 
power spectrum with a relationship to the mass power spec- 
trum which is well fitted by such a functional form. For 
IRAS/optical bias, we had b\ ~ 62, and so this suggests 
(61, fe) = (1-6, 1.6), for an overall linear bias of 1.6. There is 
no guarantee that this is the exact bias that applies to the 
real universe, but it will suffice to illustrate the effects of a 
realistic amount of scale-dependent bias. Figure 8 shows the 
recovered linear spectra for these various cases. 

In all cases, what we see is a spectrum which breaks 
rather abruptly, as expected from the discussion of the fail- 
ures of the CDM spectrum. As hoped, the f2 = 1 reconstruc- 
tions with different bias assumptions differ only slightly. For 
k < lftMpc -1 , a satisfactory empirical description of the 
data is provided by a simple transition between two power 
laws: 

2 _ (k/kor 

A W ~ [1 + (*/*!)(«»-«/*]«' W 

as shown in Figure 9. For low-density unbiased models, the 
parameters are 

k = 0.42 ftMpc -1 
fci = 0.057 ftMpc -1 

a = 0.74 (35) 
(3 = 4.0 
<5 = 0.6. 

A value of f3 = 4 corresponds to a scale-invariant spectrum 
at large wavelengths, whereas the effective small-scale in- 
dex is n = —2.3. Note that here and below we focus on 
= 0.3 as a representative low-density model, but almost 
indistinguishable linearized results would be obtained if this 
was varied by up to a factor 2 (unless bias was varied as a 
function of O) . For Q. — 1 , the parameters change to 

fco = 5.0 /tMpc -1 
fci = 0.066 feMpc -1 

a = 0.42 (36) 
13 = 4.0 
8 = 1.0, 

independent of the assumed scale dependence of bias for 
linear wavenumbers k < 0.4 h Mpc -1 . These fits are sim- 
ilar in form to a model advocated by Branchini, Guzzo 
& Valdarnini (1994). On the basis of a redshift survey of 
the Perseus- Pisces region, they suggested a phenomenolog- 
ical power spectrum in which the small-scale index was 
n = -2.2, with a break at k ~ 0.08 h Mpc -1 . They did not 
consider the nonlinear evolution of any alternative model, 
but this was an early indication that a very flat small-scale 
spectrum might fit the clustering data. 

It should be noted that this linearization process is close 
to being unstable, because the nonlinear response rises very 
rapidly as the linear spectrum approaches n = —3. The 
small-scale index cannot greatly exceed n = —2.3, but it 
could be flatter. This is an issue for the low-density mod- 
els in particular, as can be seen in Fig. 8a. The analytic fit 
is not perfect, with the data lying slightly higher around 
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Figure 8. Linearized versions of the optical power-spectrum data 
from Fig. 3, with APM results shown as solid points and scaled 
IRAS results as open circles. The solid lines are the power-law fits 
described in the text. Fig. 8a assumes no bias and shows results 
for open and flat £2 = 0.3 models. These are similar on large 
scales, but the flat results require an abrupt steepening on small 
scales. Fig. 8b assumes £1 = 1 and a linear bias of b = 1.6. If this 
is independent of scale, a small-scale feature is again required. For 
the more plausible scale-dependent bias described in the text, the 
linear spectrum is more nearly a single power law on small scales. 

k = 0.1/iMpc -1 and lower around k — 0.3 h Mpc -1 . It is 
not possible to achieve a better fit, since this would require 
a local n ~ — 3 between k ~ O.lfeMpc -1 and 0.2/iMpc -1 . 
Such a flat spectrum would suppress the required power at 
k = 0.3ftMpc _1 , resulting in a peak in the linear spec- 
trum at k ~ 0.1 ft Mpc -1 . This is a fascinating possibility, 
but needs further investigation; the PD96 formulae have not 
been tested in the regime where the concept of a hierarchi- 
cal density field breaks down. In any case, the errors on 
the data are such that the apparent misfit in Fig. 8a is of 
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questionable significance. For the present, the conservative 
approach is to stick with a monotonically increasing A 2 (fc), 
and the linearization in Fig. 8a therefore assumes the local 
slope n(k) given by the analytic fit. 

The fits work well enough at large scales, but in most 
cases they fail to fit the reconstruction at the smallest scales, 
and there is generally a 'bump' where the power increases 
quite abruptly. This can be understood with reference to the 
nonlinear spectra shown in the discussion of the evolution 
of the CDM spectrum. High-density and flat models with 
A ^ tend to hit the virialized regime quite early, flattening 
the predicted power spectra for A 2 > 100. This is not seen 
in the data, and so a feature must be introduced into the 
linearized spectra in order to keep the nonlinear spectrum 
steep. For open models, this feature is not required for O < 
0.3. For Q — 0.3, the small-scale flattening is just starting 
to become significant, and smaller values would arguably 
be preferred. However, as discussed below, the higher value 
gives a better match to the data on clustering evolution. The 
fitting formula is easily amended to deal with this small-scale 
behaviour: 

(k/k ) a + (fc/fc 2 ) 7 



A'(fc) = 



[l + (fc/fci)(«-0)/*]*' 



(37) 



A value of 7 = 3 (n = spectrum) is appropriate, and 
the required normalization is ki — 0.72 ftMpc -1 for the flat 
= 0.3 case (increasing to 0.79 for £1 = 0.2), or fc 2 — 
0.91/iMpc -1 for fl = 1. The spatial scales of interest for 
the small-scale feature, where correlations in the range 100 
- 1000 are found, are separations of about 0.5 - 0.2 h^ 1 Mpc. 

The alternative explanation is that the mass spectrum 
does flatten on these scales, but bias steepens it again. This 
is hard to rule out, but it is does seem a little contrived 
that the effects of bias and gravitational nonlinearity should 
conspire to cancel each other out in this way. The only case 
where such a conspiracy is not required is when the uni- 
verse is open with fi < 0.3; the virialized regime then oc- 
curs at sufficiently high overdensity that the predicted non- 
linear spectrum remains almost a single power law up to 
A 2 > 1000, without having to introduce a feature in the 
linear spectrum. This point is illustrated in Figure 9. 

3.4 Physical models 

What is the interpretation of the linear spectrum? We have 
argued that CDM models are inadequate for describing any- 
thing but the very large-scale portion of the spectrum, and 
so the temptation to put a physical meaning to the best- 
fitting shape parameter through T* ~ Qh may not always 
be correct. Nevertheless, many of the alternative models pro- 
duce less small-scale power than a CDM model of a given 
density, so a lower limit to flh may be obtained by this ar- 
gument. In this case, it is worth noting that a substantial 
density is implied: fi < 0.2 would be difficult to reconcile 
with any reasonable Hubble constant. 

Within the compass of models where the dark matter 
is still cold and collisionless, there is only one alternative in 
the literature that produces a spectrum with the required 
sharp break: isocurvature initial conditions (Efstathiou & 
Bond 1986). However, in this case the large-scale rms CMB 
anisotropics would be a factor of 6 higher for a given level of 
large-scale matter fluctuations, and so this possibility must 
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Figure 9. Illustrating the fit of the two power-law model for the 
linear spectrum in the case of an Q = 0.3 open model. For the 
same linear spectrum, Q = 0.3 flat and Q = 1 models produce 
successively more marked flattening of the small-scale nonlinear 
spectrum. In these latter cases, a small-scale feature in the lin- 
ear spectrum is required in order to keep the nonlinear spectrum 
steep. 

Table 1. Best-fitting parameters for physical power spectra. 
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n 
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CDM 
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0.109 





CDM 


0.3 


0.090 





HDM 


1 


0.440 


l 


HDM 


0.3 


0.429 
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MDM 


1 


0.373 


0.307 


MDM 


0.3 


0.393 


0.335 



be rejected. 

Assuming that 'designer inflation' and features in the 
primordial spectrum are rejected, we are left with the alter- 
native of modifying the matter content. Interesting possibil- 
ities with high density are either mixed dark matter (e.g. 
Holtzman 1989; Taylor & Rowan-Robinson 1992; Klypin et 
al. 1993), or non-Gaussian pictures such as cosmic strings + 
HDM, where the lack of a detailed prediction for the power 
spectrum helps ensure that the model is not yet excluded 
(Albrecht & Stebbins 1992). It has been argued that an 
MDM spectrum with Qh ~ 0.5 and approximately 30 per 
cent of the density in a hot component gives a spectral shape 
quite close to the empirical one. This would be true whether 
or not the total density was fi = 1, so one could also con- 
sider low-density MDM models. Pogosyan & Starobinsky 
(1995) give analytic formulae for the MDM transfer func- 
tion, which they express as T M dm = T C DM(k,h)D(k, ft, fi„), 
assuming £1 — 1. If we remove this assumption, then the 
formulae can still be used, replacing k/h 2 by [k/h]/F* and 
f2„ by f v — Q v /Q. Note that there appears to be a misprint 
in the definition of the parameter f3 after their equation (3): 
it should be /3 = [5 - (25 - 24!^) 1/2 ]/4. 

Fig. 10 shows the fit of this 2-parameter form to the lin- 
ear power data, and contrasts this with the poor fit of CDM 
alone. This plot also compares pure HDM models, restricting 
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Figure 10. The linearized spectrum data from Fig. 8, fitted 
with CDM (dashed), HDM (dotted) and MDM (solid) models. 
The best- fitting parameters are given in Table 1. MDM spectra 
provide the best fit, although HDM works well over the range 
to which the fit was restricted (k < 0.2/tMpc- 1 for HDM; 
k < 1/iMpc -1 in the other cases). 



the power spectrum. 

However, both HDM and MDM models face problems 
that are generic to any model with a very flat high-A; spec- 
trum in a high-fi universe: difficulty in forming high-redshift 
objects. These are classic problems for HDM models, but 
also appear to be fatal for MDM (Mo & Miralda-Escude 
1995; Ma & Bertschinger 1994; Mo & Fukugita 1996). As 
shown above, high-density models also have difficulty in 
achieving a steep correlation function on small scales; both 
this point and high-z galaxy abundances argue for an ad- 
ditional small-scale component in the linear spectrum, if 
n = 1. Both these difficulties disappear in an open uni- 
verse, but it is hard to feel much enthusiasm for the ugly 
combination of MDM and £1 < 1. Apart from aesthetics, 
we have seen that the value of V* in this model is much 
higher than would be required in a universe with reasonable 
h and f2 ~ 0.3. In summary, all known physically-motivated 
models for the clustering power spectrum face very serious 
difficulties. It appears that some completely new alternative 
is needed. 



4 EVOLUTION OF CLUSTERING 

Having identified a variety of routes by which the present- 
day clustering of galaxies could have arisen, the obvious way 
to break the degeneracy is to look at the change of cluster- 
ing with epoch, for which the various models make quite 
different predictions. For a number of years, the evolution 
of clustering has been probed by means of the angular corre- 
lation function, w(8), and its change with magnitude limit. 
Limber's equation (e.g. Peebles 1980) allows a relation to be 
made between w{&) and £(r) once the redshift distribution 
is known for the galaxies under study. A number of studies 
have been carried out, of which some of the more recent are 
Efstathiou et al (1991); Neuschaefer, Windhorst & Dressier 
1991; Couch, Boyle & Jurcevic (1993); Roche et al. (1993). 
Although there has been some debate over the interpretation 
of these studies owing to the uncertain redshift distribution 
at very faint limits, there has been a measure of agreement 
over two conclusions: 

(1) The slope of the correlation function appears not to 
alter with redshift. If £(r) oc r~ 7 , then 7 is in the region 
of 1.7 at all redshifts up to z ~ 1. 

(2) The rate of evolution of clustering is relatively rapid, 
close to the linear-theory rate of evolution in an 
Einstein-de Sitter universe: £(r, z) oc (1 + z)~ 2 . 



the fit to the large-scale portion of the spectrum. Both HDM 
and MDM achieve a satisfactory fit for k < 0.2 h Mpc -1 , and 
both spectra in fact make similar nonlinear predictions, un- 
derlining the earlier discussion on the instability of the lin- 
earization at large k. The parameters for these models are 
given in Table 1. The preferred values of F* for HDM and 
MDM are something of a puzzle, since they are in the region 
of 0.4. This is too large to be consistent with low-density 
models for any reasonable Hubble constant, and would re- 
quire h ~ 0.4 for fi — 1. Many would also think that this is 
an unreasonable number, but it does at least yield a reason- 
able age for Q = 1 models. The Einstein-de Sitter universe is 
thus the only case where a consistent picture for the present 
universe can be made using an a priori physical model for 



4.1 The predicted rate of evolution 

Several workers have commented that these conclusions are 
paradoxical. A common model for predicting the evolution 
of clustering is Peebles' stable-clustering argument. In the 
limit that clusters are virialized entities of fixed proper size, 
clustering will change with epoch only because the back- 
ground density evolves as (1 + z) 3 . In proper length units, 
this predicts £{r p ,z) oc (1 + z)~ 3 . In comoving units, used 
everywhere else, one is then led to write 

£(r, 2 ) = [r/roH(l + z)- (3 ^ +e) , (38) 

where e = is stable clustering; e = 7 — 3 is constant comov- 
ing clustering; 6 = 7 — 1 is fl = 1 linear-theory evolution. 
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It is usually argued that biased CDM-like universes would 
therefore predict e significantly less than zero, whereas the 
data favour e ~ 1. An alternative way of looking at this is 
to ask how the typical proper size of a cluster changes with 
redshift: R p cc (1 + z)^ 13 . Clearly, e = (7 — 3)/3 and so e ~ 1 
implies that f3 ~ — 1: clusters must contract about as fast as 
the universe expands. 

In short, the paradox is that galaxy clustering is re- 
quired to evolve in a way that resembles linear theory in an 
Q — 1 universe, keeping the same shape and changing its am- 
plitude rapidly, even though the data are well into the non- 
linear regime. Since the above analysis says that clustering 
should then evolve more slowly, this has led some (e.g. Efs- 
tathiou et al. 1991) to suggest that the galaxies seen in the 
faint surveys are a different population, with much weaker 
intrinsic clustering. 

However, there is a serious loophole in the stable- 
clustering argument, since we have seen that the observa- 
tions of clustering mainly do not reach the stable-clustering 
regime. In the intermediate quasilinear transition to linear 
behaviour, PD96 have shown that the steep form of /ml pro- 
duces evolution of clustering that is much more rapid: 
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Figure 11. The linear growth in density perturbations, scaled 
to unity at the present, for Q = 1, 0.3 & 0.1. Open models are 
shown as solid lines, fiat models arc dashed. 



£(r,z) = [r/roP [D(z)\ 



(6-2 7 )(l+a)/3 



(39) 



where a ~ 3.5 - 4.5 is the transition slope in / NL and D{z) 
is the linear-theory density growth rate (D(z) = (1 + z)^ 1 
if SI = 1). For a = 4, 7 = 1.7, this gives £ oc D 4 ' 3 , which 
is about twice the observed evolution if fi = 1. Ironically, 
we still conclude that the observed clustering evolution is 
inconsistent with nonlinear evolution in an fi = 1 universe - 
but now because the real evolution is too slow, not too fast. 
Agreement with the data requires a linear growth rate which 
is about half as rapid as the fi = 1 D(z) oc (1 + z) 1 - Fig- 
ure 1 1 shows the linear-theory growth factor plotted against 
scale factor for various values of fi for open and flat models 
(see e.g. Carroll et al. 1991). This suggests that the observed 
rate of evolution can be supplied naturally by fi ~ 0.2 - 0.3 
[open] or 0.1 - 0.2 [flat], without having to postulate that 
the faint galaxies are a separate population. 

We can do better than this general discussion by look- 
ing at the detailed data on the evolution of clustering from 
the Canada- France Redshift Survey (Le Fevre et al. 1996). 
From a sample of 591 galaxies selected to I < 22.5, the 
CFRS team are able to deduce the spatial clustering in sev- 
eral redshift bands out to z = 1. They avoid redshift-space 
effects by working with the projected correlation function 
discussed in the context of IRAS galaxies. Le Fevre et al. 
call this quantity w(r p ), but we shall use the symbol H(r) 
as before. The only important difference is that Le Fevre 
et al. quote their results as a function of proper separation, 
whereas comoving units are preferred here. It will also be 
more convenient to use the dimensionless function H(r)/r, 
which is then directly related to overdensity and so does 
not change as the length units are scaled from proper to 
comoving. 

The other technicality to worry about is that the de- 
duced values of H(r)/r depend on the cosmological model 
used. The CFRS results are quoted for fi = 1; inspection of 
the integral relating S to £ says that the projected results 



must scale in the following way for other cosmologies: 
Hi (ri) _ dxi Dn gn(rn) 



rn 



(40) 



where the increment of comoving distance is 
R 0dx - 



(41) 



yU, + fi m (l + zf + (1 - fi m - fi„)(l + z) 

D — RoSk(x) is comoving angular-diameter distance, and 
Ro = (c/Ho)\l — fi m — fi„|~ 1//2 . At a given r, H/r also de- 
pends on cosmology because ri/rn = Di/Dq. Overall, this 
shift and the multiplicative factors cause a larger H/r to 
be inferred for low fi. Assuming a slope of 1.65, the total 
shift of power at z = 1 is a factor 1.4 (fi = 0.3 open) or 
1.9 (fi = 0.3 flat); these models require an evolutionary e 
which is smaller by 0.5 and 0.9 respectively. There is thus 
something of a cosmic conspiracy: the low-density models 
in which dynamics yields less rapid clustering evolution are 
also those in which geometry causes less rapid evolution to 
be inferred from a given set of data. 

4.2 Comparison of models and CFRS data 

The results of a comparison between the CFRS data and 
various models are shown in Fig. 12. This shows that the 
projected correlations are very close to a single power law on 
all nonlinear scales, and that the amplitude changes by only 
a factor ~ 2 between z — 0.34 and z = 0.86. The solid lines 
in Fig. 12 are the predictions of the double power-law spectra 
deduced from the local data, and they show that it is not 
easy to satisfy the twin constraints of the right shape and the 
right rate of evolution. All models tend to flatten on small 
scales as clustering reaches the virialized regime. For fi = 0.3 
(open), this tendency is relatively minor, but for flat models 
it is a stronger effect, and produces a huge discrepancy in 
the case of fi = 1. If we are to live in an fi = 1 universe, then 
the two powers of ten discrepancy in small-scale clustering 
at high redshift revealed by Fig. 12c must be bridged by bias. 
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Figure 12. The CFRS projected clustering data (calculated as 
a function of comoving separation for three cosmologies) in three 
redshift ranges, compared to model predictions. The solid lines 
show the evolution predicted for the double power-law fits to the 
linear power spectra shown in Fig. 8. The dashed lines show the 
effect of adding an additional small-scale component to the linear 
spectrum, such as is needed to account for the present-day clus- 
tering spectrum on the smallest scales. In the case of the f! = 1 
model, b = 1.6 at the present was assumed in order to obtain the 
mass spectrum. However, unless there is an additional small-scale 
component, the implied small-scale bias at high redshifts is very 
much greater than this, on scales where the spectrum is only 
mildly nonlinear. 

This is implausible, not just because of the large amplitude 
of the bias, but because of the scale on which it occurs. 
If we ask what power spectrum is required to produce the 
observed power-law clustering, and compare it with the Q = 
1 nonlinear prediction at z = 0.86, there is a discrepancy 
of a factor ~ 100 at the point where the prediction has 
A 2 ~ 1. This conflicts with the general idea that local bias 
schemes would yield a near-constant bias where the mass 



fluctuations are in the linear regime. If bias of this magnitude 
were a reality, it would mean that the galaxy distribution at 
high redshifts is dominated by non-local effects which bear 
little relation to the mass distribution. The known existence 
of massive high-redshift clusters (Luppino & Gioia 1995) is 
probably sufficient to rule out this idea. 

Alternatively, we may take seriously the small-scale 
'kick-up' in the linear spectrum that is required if the idea of 
strongly scale-dependent bias is rejected. The dashed lines 
in Fig. 12 show the results of adding in this feature, which 
changes the picture so that the variation between different 
cosmologies is now much reduced. The hypothetical addi- 
tional component dominates the clustering in the £1 = 1 
case, and greatly increases the small-scale clustering in the 
A-dominated universe. We now see the standard stable- 
clustering evolution on most scales, with something more 
rapid on the largest scales. The stable-clustering rate in the 
A-dominated case is in reasonable accord with the data as 
calculated for this model, as expected from the above dis- 
cussion. The same would also be true of the Q — 1 model 
if the same degree of bias (b = 1.6) applied at z = 1 as at 
z = 0. However, the model that comes closest to matching 
the overall shape and normalization of high-redshift cluster- 
ing is the open universe. The main difficulty for this model 
is that it over-predicts the clustering at z = 0.34, as does the 
flat model. Le Fevre et al. point out that the galaxies in this 
bin are of low luminosity only, so this may not be a problem. 
The highest-redshift bin contains the galaxies most nearly 
comparable to the local samples, and this is well matched in 
amplitude if Q ~ 0.3. The preferred range for f2 depends on 
how seriously the CFRS errors are taken, but models out- 
side the range f2 = 0.2 - 0.5 give a poor fit if we stick to the 
assumption that there is little bias. 

Given the realistic uncertainties in the clustering data, 
there is not much to choose between these latter alternatives. 
Le Fevre et al. find a difference in clustering amplitude of a 
factor 1.8 between their red and blue sub-samples at inter- 
mediate redshifts, and the difference between models is of 
this order. One way of progressing from this point will be 
via data at higher redshifts, where the evolution of the dif- 
ferent alternatives continues to diverge. Fernandez-Soto et 
al. (1996) have argued from clustering of quasar absorption- 
lines that the scale-length for the correlation function at 
z = 2.6 is r ~ 0.4 /iT 1 Mpc (for fi = 1). The models with 
additional small-scale features in the linear spectrum would 
predict a much larger number, since evolution would have 
to proceed at the stable-clustering rate. If we could be con- 
vinced that the absorbers are not strongly antibiased, this 
result would therefore argue in favour of open models. 

To sum up the conclusions of this Section, open uni- 
verses with a linear spectrum in the form of a simple break 
between two power laws predict the correct form and rate of 
evolution of galaxy clustering up to z = 1. To obtain com- 
petitive results in f2 = 1 or flat universes, we must believe 
that the linear power spectrum contains an extra feature, 
which comes to dominate for k > 1 ftMpc -1 . 

5 CMB ANISOTROPIES 

A consistent model must match the normalization of the 
mass fluctuations on large scales inferred from fluctuations 
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in the Cosmic Microwave Background. In making this com- 
parison, it is important to be clear that the CMB fluctua- 
tions depend only on the very large-scale P oc k n portion of 
the spectrum. Predictions of smaller-scale fluctuations such 
as the amplitude as then need additional information in the 
form of the transfer function. Rather than quoting the as 
implied by the CMB, it is therefore clearer to give the large- 
scale normalization separately. 

The large-scale normalization from the 2-year COBE 
data in the context of CDM-like models is discussed by 
Bunn, Scott & White (1995); White & Bunn (1995); and 
Stompor, Gorski & Banday (1995). The final 4-year COBE 
data favour slightly lower results (Bennett et al. 1996), and 
we scale to these in what follows. For scale-invariant spec- 
tra and fi = 1, the best normalization of the primordial 
spectrum is 

A 2 (fc) = (fc/0.0737/iMp C - 1 ) 4 , (42) 

equivalent to Qrms = 18.0 fiK, or e = 3.07 x 10 -5 in the no- 
tation of Peacock (1991), with an rms error in density fluc- 
tuation of 8%. For low-density models, a naive analysis as in 
PD94 suggests that the power spectrum should depend on fi 
and the growth factor g as P oc c? 2 /fi 2 . Because of time de- 
pendence of gravitational potential (integrated Sachs- Wolfe 
effect) and spatial curvature, this expression is not exact, al- 
though it captures the main effect. From the data of White 
& Bunn (1995), a better approximation is 

A 2 (fc)(x^<7 - 7 . (43) 

This applies for low-fi models both with and without vac- 
uum energy, with a maximum error of 2% in density fluc- 
tuation provided fi > 0.2 (and gives the same as values 
as Gorski et al. (1995), when the appropriate F* corrections 
are made, to within 3%). Since rough power-law approxima- 
tions for g are g ~ fi 0,65 and fi 0,23 for open and flat models 
respectively, we see that the implied density fluctuation am- 
plitude scales approximately as fi -0 ' 12 and fi~ 0,69 for these 
two cases. The dependence is very weak for open models, 
but vacuum energy implies very much larger fluctuations. 
Note that these conclusions can be reached without being 
specific about the generation of anisotropies in open models. 
The existence of a curvature scale destroys the possibility of 
a unique scale-invariant spectrum, but such a concept does 
have a meaning on smaller scales. Since the main COBE 
signal comes from multipoles with £ > 10, the ambiguity in 
the predicted power from very low £ is unimportant unless 
fi < 0.1. 

To avoid large uncertainties caused by different transfer 
functions, it is interesting to compare this COBE normaliza- 
tion with the observed power on the largest reliable scales: 

A 2 pt(fc = 0.02 h Mpc -1 ) ~ 0.005. (44) 

For this scale, the COBE scale-invariant prediction is 0.0054. 
Reducing fi boosts this by a factor of 1.3 (open fi = 0.3) 
or 5.3 (flat fi = 0.3). The open number is well consistent 
with the observations, given the uncertainty in the transfer 
function at this point; the fi = 1 number is about three times 
too high (because the optical power should be roughly 1.6 2 
times that of the mass in the fi = 1 case); the flat number 
is too high by at least a factor 6. The last two cases would 
therefore require a tilted spectrum. 



According to the simplified analysis of PD94, COBE 
determines the spatial power spectrum at an effective 
wavenumber of 0.0012 fi 5 ft Mpc" 1 , where 5 = 1 for open 
models, 0.4 for flat. The tilt needed to match the power at 
k = 0.02ftMpc _1 is then n = 0.64 for fi = 1, or n = 0.48 
for the flat fi = 0.3 case. If gravity waves are included with 
the usual inflationary coupling between wave amplitude and 
tilt, the effect is approximately 

A 2 oc [1 + 6(1 -n)]' 1 (45) 

Allowing for gravity waves therefore changes the required 
tilt to n = 0.85 (fi = 1) or n = 0.75 (fi = 0.3 flat), but 
the conclusion remains that low-density flat models need 
an extremely large degree of tilt in order to be viable. An 
open model therefore gives the most straightforward match 
between COBE and large-scale structure. It is interesting to 
note that Ratra et al. (1996) and Gorski et al. (1996) also 
claim that the degree-scale anisotropy data are best fitted 
by an open model with fi = 0.3 - 0.4. 



6 DISCUSSION 

This paper has argued that the state of observations in 
galaxy clustering is now one of very high precision, both in 
the local power spectrum and in its evolution. Comparing 
this body of data with the predictions of nonlinear gravi- 
tational instability has yielded a number of general conclu- 
sions: 

(1) There is some evidence that more power is measured in 
redshift space than in real space, but not by as large a 
factor as expected for f3 = 1. A value /3 ~ 0.5 is more 
consistent with the value of its inferred from the cluster 
abundance. 

(2) The shape of the power spectrum is not consistent with 
nonlinear evolution of any CDM-like model. The lin- 
ear spectrum must have a sharp break around k ~ 
0.1 ft Mpc -1 . 

(3) Only low-density open models give a small-scale spec- 
trum which is a power law. For other models, either a 
feature in the linear spectrum or an abrupt increase in 
small-scale bias is needed. 

(4) Open models with fi ~ 0.3 also naturally give a spec- 
trum which evolves at the observed rate while main- 
taining the same power-law shape. Such models are fur- 
thermore the only alternatives that are consistent with 
CMB anisotropies without requiring a large tilt. 
These conclusions have been reached without invoking 

a specific mechanism for how galaxies trace mass. For low- 
density universes, we have been content to use M/L argu- 
ments to infer that the empirical degree of bias today is 
probably small, whereas substantial bias must exist if fi = 1. 
A more complete picture would have to be able to calculate 
the amount of bias and how it changes with redshift, but this 
is not well constrained at present. For example, Efstathiou 
(1995) showed how faint-galaxy clustering of the required 
amplitude could exist at z — 1 in an fi = 1 CDM universe 
by explicitly following the formation of low-mass haloes of 
dark matter in a numerical simulation, and identifying these 
with galaxies at that time. The problem with this argument 
is that different choices of halo mass (and halo environment) 
produce huge variations in the clustering amplitude, so this 
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route does not yet give a robust prediction for the cluster- 
ing of faint galaxies. This ability to obtain very different 
clustering properties from different plausible ideas for how 
and where galaxies form warns us to take great care in in- 
terpreting clustering data; even so, it is undeniably striking 
how closely an open universe in which galaxies nearly trace 
mass accounts for a variety of aspects of galaxy clustering 
and its evolution. 
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